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Abstract 


This  paper  presents  an  algebraic  definition  of  Knuthian  semantic 
systems  (K-systems  or  attribute  grammars)  with  both  synthesized  and  inherited 
attributes.     The  approach  is  based  on  the  initial  algebra  semantics  principle 
formulated  by  Goguen,  Thatcher,  Wagner  and  Wright.     Given  a  K-system  semantic 
definition  for  a  context-free  grammar  G,  it  is  shown  how  to  construct  a 
many-sorted  algebra  3C  such  that  the  semantic  mapping  from  G-derivation 
trees  into  their  "meaning"  is  the  unique  homomorphism  from  the  initial 
algebra  T«  of  derivation  trees  intoCJC  .     The  practical   implications  of  the 
algebraic  definition  of  K-systems  are  discussed,  and  the  combined  use  of 
Knuth's  original  formulation  and  the  algebraic  approach  for  the  development 
of  semantic  definitions  is  advocated.     The  usefulness  of  the  algebraic  formu- 
lation of  K-systems  is  demonstrated  by  its  application  to  proving  the 
equivalence  of  K-systems  having  the  same  underlying  grammar.     It  is  shown 
that  such  proofs  may  require  verifying  that  a  K-system  possesses  certain 
properties.     To  this  end,  a  principle  of  structural   induction  on  many- 
sorted  algebras  is  formulated,  justified,  and  applied. 
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1.  Introduction 

The  purpose  of  this  paper  is  to  present  an  algebraic  treatment  of  Knuth's 
approach  to  the  specification  of  the  semantics  of  context-free  languages  [14, 
15,  16],  Knuthian  systems,  hereinafter  called  K-systems,  have  been  called 
declarative  semantics  [6,  7,  31 ,  32],  K-grammars  [1]  and  attributed  grammars 
[3,  13,  20,  21].  The  latter  term  seems  to  dominate  in  the  more  recent  litera- 
ture, but  it  emphasizes  a  syntactic  point  of  view.  We  view  K-systerns  as  Knuth 
originally  intended,  namely  as  a  method  for  assigning  meanings  to  derivation 
trees  on  context-free  grammars.  This  viewpoint  is  in  full  agreement  with  the 
algebraic  approach  to  semantics. 

K-systems  have  been  used  in  many  Computer  Science  investigations,  such 
as  programming  language  translation  [20],  program  correctness  [9],  program 
optimization  [24],  question-answering  systems  [25],  semantics  of  programming 
languages  such  as  SIMULA  67  [31]  and  PL360  [6],  and  programming  language  design 
[1].  Also,  various  formal  models  for  programming  language  syntax  and  translation 
related  to  Knuthian  semantics  have  been  proposed  [5,  17,  23,  30]. 

Goguen  and  Thatcher  [11]  have  given  an  initial  algebra  formulation  of 
K-systems  in  which  only  synthesized  attributes  can  appear.  They  leave  open  the 
problem  of  including  inherited  attributes  in  an  algebraic  formulation  of  K- 
systems.  We  propose  a  solution  to  this  open  problem  in  this  paper.  Our 
solution,  a  formulation  of  K-systems  within  the  framework  of  initial  algebra 
semantics,  combines  the  intuitive  appeal  of  K-systems  with  the  theoretical  power 
of  algebraic  methods.  This  approach  permits  a  simple  and  precise  definition  of 

; K-systems,  and  has  been         used  by  Chirica  [4]  in  proofs  of  correctness 

i 

of  programming  language  translators.  Although  we  argue  that  it  is  possible  to 
'convert  any  K-system  definition  (including  a  circular  one)  into  an  equivalent 
jalgebraic  definition,  we  examine  the  more  useful  alternative  of  using  both 
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methods  for  writing  semantic  definitions.  We  are  thus  freed  from  the  combi- 
natorial aspects  of  K-systems  such  as  tree  traversal,  topological  sorting  and 
especially  the  circularity  problem,  while  taking  full  advantage  of  their 
intuitive  appeal.  A  more  recent  version  of  [11],  [12],  mentions  two  unpublished 
suggestions  for  solving  the  problem  discussed  in  this  paper.  One,  by  Goguen 
and  Zamfir,  is  different  than  our  approach;  the  other,  by  Burstall,  appears 
to  be  more  related  to  our  work. 

In  this  paper  we  assume  a  knowledge  of  K-systems,  many-sorted  (hetero- 
geneous) algebras  [2,  11],  and  Scott's  lattice-theoretic  approach  to  computa- 
tion as  modified  by  Gordon  [10].  Section  2  briefly  reviews  the  initial  algebra 
approach  to  semantics.  The  fundamentals  of  chain-complete  posets  and  K- 
systems  are  discussed  in  Section  3.  Section  4  presents  an  algebraic  formula- 
tion of  K-systems  together  with  discussion  and  an  example.  In  Section  5  we 
discuss  the  practical  implications  of  the  algebraic  definition  of  K-systems, 
where  we  also  argue  in  favor  of  the  combined  use  of  Knuth's  original  formula- 
tion and  the  algebraic  approach  for  the  development  of  semantic  definitions. 
Section  6  demonstrates  the  usefulness  of  the  algebraic  formulation  of  K-systems 
by  applying  it  to  the  problem  of  proving  that  two  K-systems  are  equivalent. 
A  principle  of  structural  induction  on  many-sorted  algebras  useful  in  carry- 
ing out  such  equivalence  proofs  is  formulated,  justified,  and  applied. 


; 
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2.  Notation  and  Mathematical  Preliminaries 

In  this  section  we  briefly  review  the  initial  algebra  approach  to 
semantics  and  the  elementary  theory  of  complete  posets.  In  the  sequel,  e 
denotes  the  empty  string,  $  the  empty  set,  and  N  the  non-negative  integers. 
N  together  with  its  natural  order  relation  is  denoted  w. 
2.1  Initial  Algebra  Approach  to  Semantics 
Many-Sorted  Algebras 

Let^  be  a  set  of  symbols  called  sorts.  An  ^-sorted  operator  domain 
(or  signature)  E  is  a  family  {E   }     .*»  of  sets.  An  element  o  e  Z 
is  called  an  operator  symbol  of  type  <w,s>,  arity  w,  (target)sort  s,  and 
rank  lg(w). 
2.1.1  Definition  (Many  Sorted  Algebra) 

Let  E  be  ani-sorted  operator  domain.  An  (i-sorted)  E-algebra  consists 
of  a  family  X  =  {X  }   «  of  sets  together  with  a  function  aY:  Xw  ->  X   for 

S  S^n  AS 

each  a  e   E   ,  w  e£*>     s  e%.     If  w  =  s-,s9»«»s  ,  where  n>0  and  s.  e^, 

l<i<n,  then  XW  =  Xc  *  Xc  x...xX   ;  when  w  =  e,  Xw  =  {e}.     K 

sl    s2     sn 

The  sets  X  ,  s  e£,   are  called  the  carriers  of  the  E-algebra  X  (we  follow 
the  standard  practice  of  referring  to  a  E-algebra  and  its  family  of  carriers 
:>y  the  same  name).  The  functions  a,  are  called  operations  in  the  algebra  X 
and  are  said  to  have  the  same  type,  arity,  sort,  and  rank  as  the  operator 
symbol  a  with  which  ax  is  associated.  An  operation  g„  of  type  <e,s>,  is  a 
"unction  o^,:   {e}  ->   Xg,  called  a  constant  (or  nullary)  operation.  It  is 
)ften  convenient  to  identify  o.  with  the  element  oAz)   of  X  ;  in  this  case 
'x  is  called  a  constant  of  sort  s.  Many-sorted  algebras  (E-algebras)  are 
ilso  called  heterogeneous  algebras  [2]. 
-.1.2  Definition  (Homomorphism) 

Let  X  and  Y  be  E-algebras.  A  E-homomorphism  is  a  family 


f  When  w  is  a  string,  lg(w)  denotes  the  length  of 
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h  =  (h  :  X,  ■>  Y  }   o  of  functions  (abbreviated  h:  X  -*■   Y)  such  that  for  all 
s   s    s  se ;d 


<w,s>  e,**  xi  ,  o  e  E   ,  and  x  e  Xw, 


hs(ax(x))  =  ay(hw(x))  , 

where  if  w  =  s,s9»»»s  ,  n>0,  s.  e  &  ,   l<i<n,  hw  =  <h  ,...,  h  >  and 

I       n 

h  (x)  =  <h  (x-, ),...,  h  (x_)>.  When  w  =  e,  h  (x)  =  e  and  the  above  equation 
s1  i       sp  n 

reduces  to  h  (ox)  =  Oy.    H 
Initial  Many-Sorted  Algebras 

The  key  property  for  the  application  of  E-algebras  to  programming 
language  semantics  is  given  by  the  following. 

2.1.3  Definition  (Initial  Many-Sorted  Algebra) 

A  Z-algebra  X  is  initial  in  the  category  of  E  algebras  and  E-homomor- 
phisms  if  and  only  if  there  exists  a  unique  E-homomorphism  from  X  to  any  other 
E-algebra.    IS 

2.1.4  Proposition 

For  each  operator  domain  E,  there  exists  an  initial  E-algebra  T„.    K 
The  proof  of  the  existence  of  T„  is  constructive,  and  is  typical  of  word 
algebra  constructions  [  2], 

In  order  to  apply  the  property  of  initiality  to  the  semantics  of 
context-free  languages,  we  define  a  class  of  many-sorted  algebras  associated 
with  each  context-free  grammar. 

Let  G  =  <V.,,  VT,  P,  S>  be  a  context-free  grammar  (CFG),  and  let 
V  =  V.,  u  Vj.  Following  [12],  we  define  a  class  of  many-sorted  algebras, 
called  G-algebras,  associated  with  G.  Define  a  string  mapping  nt:  V*  ■*■  VjJ 
such  that  nt(x)  is  the  string  of  nonterminals,  in  their  same  order,  appearing 
in  x.  Let  II  =  {tt    p  c  P}  be  a  set  of  symbols  in  one-to-one  correspondence 
with  P;  for  convenience,  the  symbols  of  11  win  be  indexed  by  production  numbes. 
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We  define  a  V.,-sorted  operator  domain,  also  called  G  to  identify  it  with  the 
grammar  G.  For  all  A  e  V.,,  w  e  Mt9   let 

Gw,A  =  {7rp  €  n  I  p  £  P'  p  =  A  "*  x'  nt^  =  w} 
and  G  =  {G  .}.  The  elements  of  the  initial  G-algebra  Tr  represent  the 

W,A  b 

derivation  trees  on  CFG  G. 

The  fundamental  precept  of  initial  algebra  semantics  of  context-free 
languages  is  that  any  G-algebra  X  can  provide  a  semantic  definition  for  the 
context-free  language  L(G)  with  (abstract)  syntax  T~;  the  unique  homomorphism 
h:  Tfi  '•*■  X  maps  each  G-derivation  tree  into  its  "meaning"  in  a  syntax-directed 
manner. 
2.1.5  Example 

Consider  the  CFG  G  =  <VN,  V-p  P,  S>,  where  VN  =  {S,  L,  A,  B}, 
Vy  =  {a,  b},  and  P  consists  of 

1.  S  -*  AL  4.  A  ->  aA 

2.  L  ->  LB  5.  A  ->  a 

3.  L  +  B  6.  B  -*  bB 

7.  B  ■>  b 

The  initial  algebra  Tfi  has  carriers  TG  D  for  all  D  e  V,,,  where  T~  D  is  the  set 
of  linear  representations  of  all  D-rooted  derivation  trees  with  terminal 
frontiers.  The  operations  of  T~  are  defined  as  follows.  Let  A  -*■  x  be  the 
p-th  production,  where  nt(x)  =  A,A2»»»A  ,  n>0.  The  corresponding  operation  is 

(Vtg(V".  tn)  =  V-VV'  n>0 

{\\  =  \   '  n  =  °' 
where  ti   c  TQ  A   ,  l<i<n.     Thus     ^^A^rhpi^^e^  7^7^-  e  TG  S  is  the  1inear 
representation  of  the  derivation  tree 
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2.2  Complete  Posets 

In  this  section  we  briefly  review  some  elementary  facts  from  the 
theory  of  partially  ordered  sets  (posets).  Complete  posets  and  continuous 
functions  thereon  are  at  the  foundation  of  the  mathematical  apparatus  re- 
quired to  resolve  certain  recursive  definitions  that  arise  in  Section  4. 
Before  doing  so,  we  remark  that  Scott  [26-29]  proposed  that  continuous  lattices 
should  be  used  in  the  theory  of  computation.  He  offered  a  unified  and  general 
framework  for  studying  recursive  definitions,  generalizing  many  previously 
known  results.  Following  Scott's  approach,  other  authors  [10,  11,  19,  22] 
have  shown  that  Scott's  ideas  can  be  applied  to  more  general  structures  such 
as  posets,  with  fewer  technical  difficulties  and  more  intuitive  appeal. 
Posets 

A  poset  is  a  nonempty  set  P  together  with  a  reflexive,  antisymmetric, 
and  transitive  relation  <  on  P.  A  poset  will  be  denoted  by  its  set  P,  the 
relation  <  being  implicit.  An  upper  bound  of  a  subset  X  of  P  is  any  element 
u  e  P  such  that  x  <,   u  for  all  x  c  L  If  u  <  u'  for  any  other  upper  bound  u' 
of  X,  then  u  is  said  to  be  the  least  upper  bound  (lub)  of  X  in  P.  We  write 

JX,   |x  or  simply  |_J  X  for  the  lub  of  X  in  P  if  it  exists.  If 

P    x73( 

n 

x  =  {x-,,..,,,  x  }  then  we  write   Jx.  to  designate  JX.  For  a  denumerable 

co 

sequence  <x.>.   of  elements  in  P  we  write   Ix.  to  designate  the  lub  of  the 

1  lew  l-r-A  1 

set  U(x-).  If  tne  empty  set  <t>  has  a  lub  in  P  it  is  called  the  least  or  the 
irN  1 
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bottom  element  of  P  and  is  denoted  ip  or  simply  1  when  there  is  no  danger  of 
confusion.  ip  has  the  property  that  ip  <  p  for  all  p  e  P.  A  poset  with  a 
least  element  is  called  strict. 

A  subset  X  c  P  of  a  poset  Pisa  chain  in  P  iff  for  x,  y  e   X,  either 
x  <  y  or  y  <  x  (total  order).  The  number  of  elements  in  a  finite  chain  is 
called  the  length  of  the  chain.  The  height  ht(P)  of  a  poset  P  without 
infinite  chains  is  the  least  upper  bound  (in  u)   of  the  lengths  of  the  chains 
in  P.  A  poset  of  finite  height  is  called  elementary.  The  notion  of  height 
of  an  elementary  poset  has  more  intuitive  appeal  if  we  think  of  it  in  re- 
lation to  the  usual  graph  representation  for  a  poset  (Hasse  diagram),  in 
which  case  the  height  is  the  number  of  nodes  in  the  longest  upward-oriented 
path  in  the  graph. 

A  poset  is  (chain)  complete  iff  every  non-empty  chain  has  a  Tub. 

n 
Since  any  finite  chain  x-,  <  ...  <  x  ,  n  >  0,  has  a  lub   Ix.  =  x  ,  it  fol- 
lows immediately  that  any  elementary  poset  is  complete.  A  chain  complete 
poset  will  be  called  a  domain.  If  a  complete  poset  is  also  strict  (elemen- 
tary) then  it  is  called  a  strict  (elementary)  domain.  We  use  the  following 
facts  in  the  sequel : 

(a).  Any  set  together  with  its  equality  relation  is  an  elementary  domain 
of  height  1 ; 

(b)  For  any  set  A,  the  disjoint  union  A  =  A  u  {i}  with  the  order 
relation  a  <  b  iff  a  =  i  or  a  =  b,  is  a  strict  elementary  domain 
of  height  2; 

(c)  If  A,,...,  A  are  elementary  domains,  then  so  is  A,  x  ...  x  A  ,  and 

n 

ht(A1  x  ...  x  An)  =  i     ht(A.)  -  n  +  1. 
1        n    i=1     l 
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Continuous  Functions  on  Posets 

Let  P  and  Q  be  posets.  A  function  f:  P  ->  Q  is  monotonic  iff  for  all 

p,  p'  e  P,  p  <  p'  implies  f(p)  <  f(p').  A  function  f :  P  ->  Q  is  (chain) 

continuous  iff  for  any  non  empty  chain  X  with  lub  in  P,  the  set  f(X)  has  a 

lub  in  Q  and  f(l  Jx)  =   J f (X) .  The  set  of  all  continuous  functions  from  P 

Y  Q 

to  Q,  denoted  [P  -+  Q],  is  a  complete  poset.  For  e^jery   strict  and  complete 

poset  P  there  exists  a  continuous  function  Fixp:  [P  ->  P]  ■*•  P,  called  the 

least  fixed  point  operator,  defined  as 

00 

Fixp(f)  =  M  fVp) 

where  f  =  lp  (identity  function  on  P)  and  f    =  f  •  f ] .  For  strict 
posets  P  and  Q,  a  function  f:  P  -*-  Q  is  called  strict  if  f(xp)  =  ±q.  In 
the  remainder  of  this  paper,  completeness  means  chain  completeness  and 
continuity  means  chain  continuity. 
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3.  Knuthian  Semantic  Systems  (K-Systems) 

In  this  section  we  review  and  discuss  K-systems  as  originally  presented 
by  Knuth.  We  then  impose  a  few  restrictions  (which  result  in  no  loss  of 
generality)  on  K-systems  which  make  them  more  easily  adapted  to  their 
algebraic  formulation  in  Section  4.  The  use  of  the  word  "domain"  in  this 
section  generally  refers  to  unordered  sets  of  values,  and  not  to  the  domains 
of  Section  2.2. 

The  basic  purpose  of  K-systems  is  the  syntax-directed  specification  of 
a  mapping  from  derivation  trees  on  a  context-free  grammar  to  corresponding 
semantic  objects  (which  can  be  vector-valued).  K-systems  ignore  how  deri- 
vation trees  are  obtained  from  the  terminal  "source"  strings  which  consti- 
tute their  frontier.  Nevertheless,  K-systems  are  sometimes  regarded  as 
defining  a  relation  (since  the  CFG  may  be  ambiguous)  between  source  strings 
and  semantic  objects,  as  depicted  in  Figure  1. 


SOURCE 
STRINGS 


DERIVATION 
TREES 


SEMANTIC 
OBJECTS 


Figure  1  -  Relations  Defined  by  K-systems 

Informally,  K-systems  as  presented  by  Knuth  in  [14]  consist  of  (1):  a 
CFG  G  =  <Vj.,  V-j-,  P,  S>  in  which  S  appears  in  the  left  part  of  exactly  one 
production  and  in  the  right  part  of  no  productions;  (2):  two  disjoint  sets 
of  symbols  associated  with  each  symbol  of  G,  called  inherited  and  synthesized 
attributes,  where  S  must  have  at  least  one  synthesized  and  no  inherited 
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attributes;  (3):  a  semantic  domain  D  for  each  attribute  a;  (4):  a  set  of 

semantic  rules  associated  with  each  production.  Each  semantic  rule  is  a 

function  f  .  :  D   x  ...  x  D   ->  D  which  defines  the  value  of  an  attribute 
pka   a-,         a.    a 

a  of  an  instance  k  of  a  symbol  appearing  in  production  p  in  terms  of  the 

values  of  attributes  a-,,...,  a,  associated  with  other  instances  of  symbols 

in  the  same  production. 

In  [16],  no  attributes  are  associated  with  terminal  symbols.  This  is 
a  reasonable  restriction,  since  the  association  of  attributes  with  terminal 
symbols  is  unnecessary  and  even  bad  because  it  binds  the  semantic  definition 
to  concrete  rather  than  abstract  syntax. 

We  now  discuss  how  the  semantic  rules  are  specified.  Let 

A  ->  aQB1a1...Brar,  r  >  0, 

where  aQ,...,  a  e  Vi  and  A,B,  ,...,B  e  V.,,  be  a  production  (say  the  p-th) 
of  G  (strictly  speaking,  r  depends  upon  p,  but  for  typographical  convenience 
this  dependency  is  not  explicitly  indicated).  The  B.,  l<k^r,  represent 
distinct  nonterminal  instances;  the  same  nonterminal  may  be  represented  more 
than  once.  According  to  [14,  16],  the  semantic  rules  f  .   associated  with 
this  production  must  define 

(i)  the  values  of  all  and  only  the  synthesized  attributes  of  A, 
and 

(ii)  the  values  of  all  and  only  the  inherited  attributes  of  B.  ,  l<k<r. 
To  the  above  requirements  we  add 

(iii)  the  values  of  the  synthesized  attributes  of  A  can  be  defined  only 
in  terms  of  the  values  of  its  own  inherited  attributes  and  syn- 
thesized attributes  of  the  B.  ,  l<k<r; 
(iv)  the  values  of  the  inherited  attributes  of  any  B,  ,  l<k<r,  can  be 
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defined  only  in  terms  of  inherited  attributes  of  A  and  synthesized 
attributes  of  any  B.,  l<j<r. 
We  claim  the  following  about  conditions  (iii)  and  (iv): 

(a)  They  do  not  affect  the  definitional  power  of  K-sys terns  or  their 
convenience  in  practical  use.  A  constructive  and  informal  argu- 
ment presented  in  the  Appendix  shows  that  any  K-system  which  does 
not  satisfy  these  restrictions  can  be  replaced  by  an  equivalent  one 
that  does; 

(b)  Semantic  rules  which  do  not  satisfy  these  conditions  are  unreason- 
able and  infringe  upon  the  very  meaning  and  purpose  of  introducing 
"inherited"  and  "synthesized"  attributes. 

The  language  proposed  in  [14]  for  defining  semantic  rules  consists  of 

(a)  variable  symbols,  denoted  a.A  (Knuth  writes  a(A)),  where  a  is  an 
attribute  symbol  and  A  is  a  nonterminal  instance  within  a  given 
production;  a.A  denotes  attribute  a  of  nonterminal  A.  Multiple 
instances  of  the  same  nonterminal  within  a  production  are  distin- 
guished by  supplementary  marks  such  as  integer  subscripts.  For 
example,  A-,  and  A?  would  both  represent  the  same  nonterminal  A, 
and  a.A,  and  a.A-  both  range  over  D  ; 

(b)  function  (and  constant)  symbol s  of  various  types  and  arities, 

interpreted  as  functions  over  the  semantic  domains  D  .  We  refer 

a 

to  these  functions  as  primitive  semantic  functions  to  distinguish 

them  from  the  semantic  functions  f  ,  ; 

pka 

(c)  terms,  constructed  in  the  usual  way  from  variables  and  function 
symbols. 

Using  this  language,  the  semantic  rules  are  specified  as  sets  of 
equations  of  the  form  "v  =  t"  where  v  is  a  variable  symbol  and  t  is  a  term. 
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Let  x.A  and  y.A  be  vectors  of  variable  symbols  which  denote,  respect- 
ively, the  inherited  and  synthesized  attributes  of  A.  This  notation  is 
not  intended  to  imply  that  each  nonterminal  symbol  has  the  same  set  of 
inherited  attribute  symbols  or  the  same  set  of  synthesized  attribute 
symbols.  In  accordance  with  conditions  (i)  -  (iv)  above,  the  semantic  rules 
associated  with  the  above  (p-th)  production  are 

x.B1  =  F  ,(x.A,  y.B-j,...,  y.Bf) 


(1) 

x.Br  =  Fpr(x.A,  y.B1 »...,  y.Br) 

(2)  y.A  =  G  (x.A,  y.Br...,  y.Br) 


When  r  =  0,  A  ->  aQ  is  called  a  terminal  production;  in  this  case  equations 

(1)  are  absent  and  equation  (2)  reduces  to  y.A  =  G  (x.A).  When  A  =  S,  x.S 

does  not  appear  in  the  above  equations.  A  K-system  in  which  only  synthesized 

attributes  appear  is  called  purely  synthesized. 

F  . (x.A,  y.B,,...,  y.B  )  is  a  vector  of  terms  interpreted  to  define 

semantic  functions  f  ,   for  all  inherited  attributes  a  of  B.  ,  l<k<r.  The 

pka  k 

vector  of  terms  G  (x.A,  x.B,,...,  x.B  )  is  similarly  interpreted  to  define 

semantic  functions  f  n  for  all  synthesized  attributes  a   of  A.  In  algebraic 

pOa        J 

terminology,  the  sets  D  and  the  primitive  semantic  functions  will  form  a 
X-algebra  for  some  specified  operator  domain  E.  The  semantic  functions  are 
therefore  derived  operators  (cf.  [12])  in  the  algebra.  This  raises,  however, 
a  serious  problem  which  will  be  discussed  in  Section  3.4.  We  now  present 
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an  example. 
3.1  Example 

Consider  the  following  K-system. 


Nonterminal 

Inherited 
Attributes 

Synthesized 
Attributes 

Semantic 
Domains 

S 

- 

X 

Dx  =  (a,b>* 

L 

y 

X 

Dy  =  {a,b}* 

A 

- 

X 

B 

- 

X 

Productions 

Semantic  Rules 

1. 

S  +  AL 

x.S  =  x.L   x.A 
y.L  =  x.A 

2. 

h  *■  LzB 

x.L1  =  x.L^y.L^x.B 
y.L2  =  y.L] 

3. 

L  ->  B  . 

x.L  =  y.L  nx.B 

4. 

A1  ->  aA2 

x.A,  =  a^x.A2 

5. 

A  ->  a 

x.A  =  a 

6. 

B1  ■*■  bB2 

x.B1  =  b^x.B2 

7. 

B  ->  b 

x.B  =  b 

The  primitive  semantic  functions  in  this  K-system  are  string  concatenation 
(  )  and  the  string  constants  "a"  and  "b".  The  semantic  objects  are  elements 
of  Ux  =  {a,b}*.  This  K-system  defines  the  source  string-to-semantic  object 
relation 
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{<ambn,  amb1lam...ambV>   |   k,m,n>0,  i  .>0,  Uj<k,     M,  =  n}. 

3  j=l  J 

This  relation  can  be  regarded  as  representing  all  partitions  of  each  positive 

integer. 

3.2  Definition  (Equivalence  of  K-Systems) 

Two  K-sys terns  are  equivalent  if  and  only  if  they  define  the  same 
source  string-to-semantic  object  relation.     K 

3.3  Example 

Consider  the  following  purely  synthesized  K-system,  equivalent  to 
that  of  Example  3.1 . 


Nonterminal 

Inherited 
Attributes 

Synthesized 
Attributes 

Semantic 
Domains 

S 

- 

X 

Dx  =  (a,  b}* 

L 

- 

x,  z 

Dz  =  {a,  b}* 

A 

- 

X 

B 

- 

X 

1 

Productions 

Semantic  Rules 

1. 

S  ->  L 

x.S  =  x.L^z.L 

2. 

Ll  *  L2B 

x.L]  =  x.L2^z.L2  °x.B 

3. 

L  ■*  AB 

x.L  =  x.A*^  x.B 
z.L  =  x.A 

4. 

A,  ■*  aA2 

X«M-i  —  Q    X«Mq 

5. 

A  ->   a 

x.A  =  a 

6. 

B1  ■+  bB2 

x.B,  =  b  ^  x.B2 

7. 

B  ->   b 

x.B  =  b 
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Note  that  the  grammar  underlying  this  K-system  is  not  the  same  as  that 
underlying  the  K-system  of  Example  3.1. 
3.4  Discussion 

The  problem  with  defining  functions  by  terms  in  a  certain  language  is 
that  a  term  does  not  quite  define  a  function  in  the  sense  that  we  do  not 
really  know  the  intended  domain  of  definition  of  that  function.  For  instance, 
the  term  "z+1"  over  integers  may  define  the  function  f:  Z  ->  Z  or  the  function 
f ' :  Z  x  Z  -*■   Z,  where  f  =  Az.z+1  and  f  =  Az-jZ^.z,  +  1.  This  may  not  seem 
very  important,  but  the  concept  of  attribute  dependency  jj_  important  in  K- 
systems  and  it  is  based  on  the  specification  of  the  domain  of  definition  of 
semantic  functions.  According  to  Knuth  [14,  p.  133], 

"(an)  attribute  a  is  defined  to  have  the  value  v  ...  if,  in 
the  corresponding  semantic  rule 

f  .  :  D   x«»-x  D   ->  D 

pja   a,       a.    a 

all  of  the  attributes  a,,...,  a.  have  previously  been  defined 
to  have  the  respective  values  v -,,...,  v.  at  the  respective 
nodes,...  and  v  =  fpj-ot(v1 ,... ,  vt)." 

This  means  that  if  f:  Z  x  Z  ->  Z  is  defined  as  f(z,,z?)  =  z,  +  1 ,  then  f(u,v) 
cannot  be  computed  if  u  is  defined  but  v  is  not. 

In  current  practical  use  of  K-systems,  which  use  the  specification 
language  described  previously,  the  information  about  the  intended  domain  of 
definition  of  semantic  functions  is  not  usually  provided.  Without  the  know- 
ledge about  the  intended  domain  of  definition,  we  cannot  establish  the  graph 
model  of  attribute  dependency  in  the  sense  proposed  by  Knuth.  Most  often 
this  information  is  deduced  on  syntactic  grounds;  e.g.,  "z+1"  defines  a 
function  f:  Z  ->■  Z  of  one  variable  since  there  is  only  one  variable  in  the 
term  "z+1".  This  approach  fails  if  we  wish  to  be  able  to  evaluate  the  term 
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"if  x>0  then  x+1  else  y"  when  x  >  0  and  y  is  undefined. 

The  solution  to  these  problems  is  based  on  the  idea  of  formalizing 
the  concept  of  a  variable  being  defined  or  undefined:  we  simply  add  a 

new  value  1  to  the  sets  D  such  that  an  "undefined"  attribute  has  the  value 

a 

i.  By  extending  the  primitive  semantic  functions  to  D  u  {i}  we  can  say 
that  an  attribute  a,  directly  depends  on  attribute  a~  iff  the  semantic 
function  defining  a,  is  strict  in  the  argument  corresponding  to  ou.  This 
does  not  conform  to  Knuth's  attribute  dependency  definition,  but  seems  to 
be  better  suited  for  practical  applications  where  we  do  not  want  to  specify 
intricate  domains  of  definition.  In  our  definition  all  semantic  functions 
attached  to  the  same  production  will  have  identical  domains  of  definition. 
These  domains  need  not  be  specified  since  they  will  be  implied  by  the  form  of 
the  production.  By  proper  extension  of  the  primitive  semantic  functions 
any  desired  dependency  among  attributes  can  be  achieved  (e.g.,  the  primitive 
semantic  function  Xpxy.rf  p  then  x  else  y  can  be  extended  in  various  ways, 
making  it  strict  in  all  or  some  of  its  arguments). 

The  last  problem  to  be  considered  is  how  to  compute  the  attribute 
values.  This  problem  is  not  given  a  full  formal  treatment  in  [14,  16].  In 
[14]  it  is  indicated  that  well  known  tree  manipulation  algorithms  may  be  used 
for  attribute  evaluation  by  supplementing  these  algorithms  with  the  necessary 
attribute  computation  steps.  The  work  of  Fang  [7],  Bochmann  [3],  and  Kennedy 
and  Warren  [18]  shows  that  this  is  not  a  trivial  task.  The  main  contribution 
of  the  algebraic  definition  given  in  Section  4  is  a  precise  mathematical 
definition  of  this  aspect  of  K-systems.  Furthermore,  the  definition  is 
independent  of  any  tree  manipulation  algorithm. 
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4.  Algebraic  Definition  of  Knuthian  Semantics 

In  this  section  we  propose  an  algebraic  definition  of  K-systems  [14, 
15,  16]  and  discuss  the  implications  and  the  use  of  this  definition  for 
semantics  and  translation  of  programming  languages. 
4.1  Definition  (K-System) 

A  Knuthian  Semantic  System  (K-system)  consists  of: 

(a)  A  context-free  grammar  G  =  <VN,  VT,  P,  S>  in  which  S  occurs  in  the 
left  part  of  only  one  production  and  in. the  right  part  of  no  productions; 

(b)  A  Z-algebra  D  for  a  given  set  of  sorts  &   and  a  given  ^-sorted 
operator  domain  Z.  The  carriers  D  ,  s  e^,  of  D  are  called 
semantic  domains  and  the  operations  f  e  E   ,  <w,s>  e£*   x  %> 
of  D  are  called  primitive  semantic  functions; 

(c)  Two  functions  ~~ :  VN  -> £*     and  _:  VN  -*-,£*,  called  attribute  type 
assignment  rules,  which  associate  with  each  nonterminal  A  e  V. 
two  strings  A  and  A  in^*  called,  respectively,  the  inherited 
attribute  type  and  the  synthesized  attribute  type  of  A.  It  is 
assumed  that  S  =  c  and  S^  f   e; 

(d)  Semantic  functions.  For  every   production  (say  the  p-th) 

f 
A  ->  aQB1a1  ...  Bpar 

in  P,  where  aQ,  a,,...,  a  e  V£,  A,  B, ,...,  B  e  V.,,  r>0,  there 

is  given  a  function  G  of  type  <AB, . . . B  ,  A>  iff  A  j-   e,  and  for 

p         — i   — r  —     — 

every   k,  l<k<r,  there  is  given  a  function  F  .  of  type  <AB, . . . B  ,  B.> 
iff  B.  f  e.     All  semantic  functions  must  be  derived  operators  of 
the  indicated  type  in  algebra  D  in  the  sense  of  [12],    M 

If  A  =  sr..sn  e^+,  then  DA  =  Ds  x...  xD_.   If  A  =  e   then  DA  =  {c}. 

A  1         Sn      A 

The  set  D  is  called  the  inherited  attribute  domain  of  A.  D-,  called  the 


t  For  typographical  convenience,  the  dependence  of  r  on  p  is  not  shown. 
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%^h^L3til^^^  of  A,  is  defined  similarly.  The  functions  Fpk 


are  of  type  <AB]  ...  Bf.,  Bk>,  i.e., 


R  B     B. 

F^XD"1  x...xD^Dk. 
pk 

If  K  -  sr-  V  then  F       is  a  Bk^Ple  of  fUnCti°nS     <FI*1—  F*m>     "^ 

n 

F,,.:  9*xlf'    x  ...  »»^»s..  lfii*  1 

pki  l 

*■  a   c-imiiarlv   Note  that  for  terminal  productions 
The  functions  G  are  defined  similarly.  Note 

j..  „,.  c      and  the  semantic  functions  G 
A  ■>  aQ  there  are  no  semantic  functions  Fpk  and  the  p 

are  of  type  <A,A>. 

4  2     SpecJficjiM^JlIJ^^^^ 

It  is  clear  that  a  specification  language  is  built  into  Definition  4.1, 

The  tasic  idea  is  that  of  assigning  variable  symbols  to  every  nonterminal  ,„- 
stance  in  a  production,  according  to  the  nontenninal  type  as  defined  by  the 
functions  «  and  -  «.     Then  we  use  terms  of  the  appropriate  type  to  speedy 
semantic  functions.     The  specification  language  outlined  here  is  the  same 
a5  the  one  discussed  in  Section  3,  only  here  it  results  naturally  from  the 
Hebraic  definition.     (In  a  sense,  Definition  4.1  formalizes  the  semantics 
of  Knuth's  specification  language  [14].) 
43    ^emlemanUcsJorm^^ 

He  must  now  make  precise  what  attribute  computation  means,  ..,, 
how  to  use  a  K-system  for  semantic  definition  of  context-free  languages. 
The  idea  is  simple  and  is  based  upon  the  following  observation  about  how  a 
Knuthian  definition  (in  the  informa,   sense)  works       For  any  given  nonten.na 
A  £  v   ,  A  f  S,  and  any  given  G-derivation  tree  t*  with  root  A,  the  values 
of  the  synthesized  attributes  y.A  of  node  A  are  obtained  in  a  unic^e  way 
fro,„  the  values  of  its  inherited  attributes  x.A.     If  the  definition  is 
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circular  then  the  value  of  some  attributes  cannot  be  computed  (i.e.,  they 

are  undefined),  but  those  which  are  defined  are  uniquely  obtained  from  x.A. 

This  observation  is  true  first  because  in  any  Knuthian  definition  (in  the 

informal  sense)  if  the  attribute  values  can  be  computed  then  they  are  unique 

[14,  p.  135],  and  second  because  of  conditions  (iii)  and  (iv)  laid  down  in 

Section  3.  The  correspondence  x.A  ^  y.A  is  a  partial  function.  If  we  use 

a  special  value  i  to  represent  undefined  attributes,  then  the  correspondence 

x.A  m-  y.A  is  a  total  function.  Accordingly,  we  assume  that  an  element  i^ 

s 
has  been  added  to  every   carrier  D  of  Z-algebra  D.  We  also  assume  that 

every  D  ,  s  e*5,  is  made  into  a  strict  elementary  domain  (D  )  as  indicated  in 
Section  2.2.  Finally  we  assume  that  the  operations  of  D  have  been  extended 
to  continuous  functions.  In  the  terminology  of  [12],  D  is  a  (chain)  con- 
tinuous Z-algebra.  From  Proposition  4.13  of  [12],  derived  operators  of 
continuous  algebras  are  continuous.  Therefore,  the  semantic  functions  4.1(d) 
are  continuous  functions  on  their  respective  domains. 

In  view  of  the  previous  discussion,  the  semantics  of  a  tree  t  with 

A    A 
root  A  in  grammar  G  is  a  function  a. A  e  [D  ■*■  D-]  which  maps  inherited  attri- 
butes into  synthesized  attributes  of  node  A.  Therefore,  the  problem  is  to 
define  the  correspondence  t  h-o.A  between  derivation  trees  t  and  functions 
a.  This  discussion  applies  to  trees  with  roots  other  than  S.  For  trees  with 
root  S  the  semantics  continue  to  be  y.S  e   D— ,  i.e.,  the  value  of  the 
synthesized  attributes  of  node  S.  We  write  a.S  for  uniformity. 

An  appropriate  tool  for  mapping  derivation  trees  into  their  semantics 
is  provided  by  the  initial  algebra  semantics.  We  define  a  G-algebra  X  with 
carriers  KA,  A  e  V^  and  operations  0p,  p  e  P  such  that  the  unique  homomorphism 
h:  Tg  ->X  maps  derivation  trees  t  into  semantic  objects  o   as  discussed  above. 
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The  choice  of  carriers  has  already  been  made  by  the  previous  discussions, 
i.e.,  KA  =  [D  -*  D-]  for  all  A  e  v  '   In  order  to  complete  the  definition 
of  J£  we  must  define  the  operations  0   ,  for  every  pep. 

Consider  a  production  p  =  At  a0B-,a,...B  a  ,  r  >  0.  The  operation  d     is 

r 

Fk    *k 
a  function  0  :  KR  x  . . .  x  Kg  -*  K»  mapping  <  a.B.  e  [D   -*  D  ]  >  l<k<r 

1         r 
p 

into  a.A  e  [DA  +  [£].  The  attributes  x.A  e  DA,  y.A  e  dA  x.Bk  e  D  k, 

^k 
y.Bk  e  D  ,  l<k<r  must  satisfy  the  equations 

y.A  =  a.A(x.A) 

(3) 

y.Bk  =  a.Bk(x.Bk)  ,  l<k<r. 

in  addition  to  equations  (1)  and  (2)  of  Section  3.  The  equations  (l)-(3) 
uniquely  determine  the  operation  0     as  follows. 

Substituting  (3)  into  (1)  and  (2)  yields 

(4)  x.Bk  =  F  .(x.A,  a.B^x.B^,...,  a.Br(x.Br)),  l<k<r 

(5)  a.A(x.A)  =  G  (x.A,  a.B]  (x.B-,) ,... ,  a..Br(x.Br)). 

Let  <E-,  (x.A), . . .,   E   (x.A)>  be  the  least  fixed-point  solution  of  system  (4) 

with  respect  to  x.Bk,   l^k<r  (see  Section  4.4).     Substituting  it  into  (5)  gives 
a.A(x.A)   =  G   (x.A,  a.B} (E] (x.A) ) ,. . . ,  a.Br(Er(x.A))), 

thus  defining  6     as 
P 

(6a)  d     (a.Br...,  a.Bp)   =  a.A 

=  Xx. A   6  DA   •   Gp(x.A,    a.B^E^x.A)),...,    a.Bf(Er(x.A))). 


t  Note  that  for  A=S,  S  =  {e}  and  thus  K<.  =  [{E}  +  [P]  =  CP. 
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When  r=0  (terminal  production)  we  obtain  the  simpler  definition 

(6b)  ep  -  Gp. 

The  above  construction  may  be  thought  of  as  a  procedure  for  transforming 
a  given  K-system  with  both  inherited  and  synthesized  attribute  into  a  new, 
purely  synthesized  K-system.  This  new  K-system  has  (only  synthesized) 
attributes  a. A,  a.B-,,...,  a.B  and  semantic  rules  defined  by  (6a)  and  (6b). 

Definition  4.1  is  now  completed  by  noting  that  the  construction  of 
the  G-algebra^C  and  the  initial  i ty  of  T~  guarantee  the  existence  of  a  unique 
homomorphism  h:  T~  ->3C,  h  =  {h„   A  e  V.,},  which  maps  derivation  trees  on 
G  into  their  semantics,  where 

hA:  TG>A->  [D*-l£],     A  e  \I{V   A  f   S, 

hS:  TG,S  *  & 

The  reader  might  now  expect  to  see  a  proof  that  the  algebraic  defi- 
nition of  K-systems  is  equivalent  to  Knuth's  original  method  of  semantic 
definition,  perhaps  as  a  theorem  of  the  form 

"For  any  t  e  T~  s,  hs(t)  =  y.S,  where  y.S  e  D-  are  the  synthesized 
attribute  values  obtained  using  Knuth's  method." 

However,  K-systems  as  presented  in  [14,  15,  16]  give  only  an  informal  idea 
about  how  attribute  values  are  to  be  computed.  Consequently,  the  above 
theorem  has  no  mathematical  meaning.  The  purpose  of  the  algebraic  definition 
given  in  this  section  is  to  provide  a  formal  treatment  of  this  aspect  of 
K-systems.  Of  course,  one  may  use  the  necessary  tree  manipulation  algorithms, 
as  suggested  by  Knuth,  and  give  an  algorithm  for  attribute  computation. 
This  algorithm,  which  would  have  to  be  accepted  on  intuitive  grounds,  could  then 
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be  used  as  an  "operational"  definition  of  K-systems.  In  this  case  we  can 

raise  the  question  of  proving  a  theorem  relating  the  algebraic  approach  to 

f 
the  "operational"  approach.  We  leave  this  possibility  open.  Instead,  we 

claim  that  the  algebraic  definition  ij_  the  formalization  of  Knuth's  method. 

4.4  Fixed-point  computation  of  attribute  values 

We  now  examine  the  fixed-point  computation  implied  by  the  solution  of 

equations  (4).  Equations  (4)  can  be  viewed  as  a  system  of  equations  in 
unknowns  x.B,  ,  l<k<r,  with  x.A  as  a  parameter.  Intuitively,  we  are  saying 

that  given  inherited  attributes  x.A  at  node  A  we  must  be  able  to  compute 

the  inherited  attributes  x.B.  at  node  B,  ,  l<k<r.  This  is  the  place  where 

the  hypothesis  about  D  being  a  continuous  algebra  plays  an  important  role. 

A" 
For  a  fixed  x.A  e  D  ,  the  right  hand  side  of  equations  (4)  gives  the  func- 


tions 


defined  as 


x  A       Bl  Br         Bk 

Fpp   Dx...xDr->DK,       l<k<r, 


(7)  F^A(x.Br...,  x.Br)   =  Fpk(x.A,  a.B]  (x.B] ) ,. .. ,  a.Br(x.Br)). 

x  A 
The  functions  F  ,"   are  continuous  and  therefore  the  functions 
pk 

(8)  H  .  =  <FX:\...,  FX'A> 

x.A    pi  '   '  pr 

Bl        Br    Bl        Br 

H  .:  D   x  ...  x  D   ->  D   x  ...  x  D 

X .  M 

A  fl        Fr 

are  continuous  for  eyery   x.A  e  D  .  Let  X  =  D   x  ...  x  D  .  Then 

H  .:  X  -*  X,  as  a  continuous  function,  has  a  least  fixed-point  in  X  defined 

X  •  M 


t  This  is  another  instance  of  a  well  known  problem:  proving  the  equivalence 
of  operational  and  mathematical  semantics.  An  excellent  analysis  of  this 
problem  can  be  found  in  [10]. 
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as 

oo 

(9)  Fi*x<Hx.A>  =  u  h;iAu). 

Therefore  the  solutions  E:  D  ->  X  of  system  (4)  are  E  =  Ax  e  D  .FiXw(H  ). 
We  know  that  with  Knuth's  approach,  even  if  there  is  a  circularity  in  the 
definition,  the  attribute  computation  must  terminate  (unsuccessfully)  with  some 
attributes  being  undefined.  It  is  disturbing  therefore  to  have  introduced 
the  least  fixed-point  function  Fixx  in  equation  (8)  since  this  function,  from 
a  computational  point  of  view,  may  lead  to  a  nonterminating  computation. 

We  now  briefly  argue  that  the  least  fixed-point  computation  always 
terminates  in  a  predetermined  number  of  steps  and,  moreover,  it  suggests  an 
algorithm  for  computing  attributes  in  derivation  trees.  Given  a  continuous 

function  f:  Y  ->  Y  on  any  strict  domain  Y  the  least  fixed-point  of  f,  FiXy(f), 
is  the  least  upper  bound  of  the  ascending  sequence  of  elements  in  Y 

i  =  f°(i)  <  f\±)   <  f2(i)  <  ...0 

This  property  suggests  a  computational  procedure  for  FiXw(f)  (provided  that 
everything  else  is  computable  in  some  sense):  iterate  the  application  of  f 
to  itself,  starting  from  the  element  i  e  Y.  This  computation  may  or  may  not 
terminate,  and  in  this  sense,  Fixv  is  a  partial  function.  However  if  Y  is. 
an  elementary  domain  (has  no  infinite  chains)  then  the  computation  always 

terminates  (if  f^i)  =  f1+1(±)  forborne  i  then  Fix„(f)  =  fU)). 

Bl        Br 
The  domain  X  =  Dx...xDj_s_  elementary  since  it  is  a  cartesian 

product  of  elementary  domains.  If  we  evaluate  the  height  of  X  we  obtain  an 

upper  bound  for  the  number  of  steps  in  which  the  computation  of  Fix„(H  „) 

terminates.  The  sets  D  ,  s  e  »§ ,  have  been  made  into  strict  elementary  domains 

by  adding  the  elements  iQ  ,  and  by  introducing  the  order  relation  <  defined 

s 
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in  Section  2.2.  The  height  of  each  D$  is  ht(D$)  =  2.  If  the  length  of  Bk 

Bk 
is  £.,  l<k<r,  then  the  height  of  D  *  is 

B. 
ht(D  )  =  L    +   1  (the  height  of  (e)  is  1) 

and  therefore  the  height  of  X  is 

r  r 

ht(X)  =  Z  (A.+l)  -  r  +  1  =  E  £.  +  1. 
k=l  K  k=l 


The  longest  chain  in  X  cannot  have  more  than  ht(X)  elements.  Discounting 
l,  which  is  an  initialization  step  in  the  least  fixed-point  computation,  it 


r 


follows  that  the  computation  must  terminate  in  I   =  E  i.  steps.  We  will 

k=l 

see  immediately  that  there  is  no  coincidence  that  i   is  also  the  total 
number  of  all  inherited  attributes  of  B, , . . . ,  B  . 

We  now  carry  out  a  few  steps  in  the  computation  of  FixY(H  .)  given 

A   X  .A 

by  formula  (9).     We  write  H^  AU)k  for  the  k-th  projection  of  H1    .(x),  l<k<r. 
The  first  step  (i=l)  of  the  least  fixed-point  computation  gives 

(10)  Hx.A(l)k  =  Fpk(x'A'  a-Bi(-L)---  o-BrU)),  l<k<r. 

Let  y, .B,  =  ff.B, (.)  and  x, .B,  =  H„  «(x), ,  l<k<r.  The  values  y, .B.  represent 
'   k     k       i   k   x.a   k Ik 

the  values  of  the  synthesized  attribute  of  node  B.  when  all  its  inherited 
attributes  are  undefined  (i.e.,  equal  to  i).  y,.B.  is  not  necessarily  equal 
to  ±,  since  some  synthesized  attributes  may  not  depend  on  the  inherited 
attributes  (they  may  be  constants).  Formula  (10)  indicates  that  the  values 
x-|.B.  are   those  of  the  inherited  attributes  of  node  B.  which  can  be  computed 
from  the  given  x.A  and  y-.-B-,  lfjir.  Of  course,  those  attributes  which  are 
not  (yet)  evaluatable  (i.e.,  they  depend  on  undefined  values)  will  be 
undefined. 
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The  second  step  (i=2)  of  the  iteration  gives 

=   Fpk(x-A>  o.B1(x1.B1),...,  a.Br(xrBr)),  l<k<r. 

2 
Let  y2.Bk  =  a.Bk(xrBk)  and  x2.Bk  =  HxA(±)k,  l<k<r.  The  values  y2-Bk  are 

new  values  of  the  synthesized  attributes  of  Bk  computed  from  the  previously 
obtained  inherited  attribute  values  x,.Bk.  Since  now  some  of  x,.Bk  may  have 
defined  values  (f  i),  we  may  assume  that  more  values  in  y2-Bk  are  defined. 
The  values  y2-Bk  together  with  the  values  x.A  are  used  in  formula  (11)  to 
obtain  new  inherited  attribute  values  x2.Bk. 

Each  successive  iteration  (i  =  3,...)  computes  new  synthesized  attribute 
values  y3-Bk  =  a.B^Xp.BjJ,. ..,  and  new  inherited  attribute  values 
x3'^k  =  Hx  A^'k"*'  '  Eac'1  deration  uses  the  previous  values  to  obtain 
new  ones  while  the  values  x.A  remain  constant.  Thus  we  obtain  the  following 
sequence  of  attribute  values 

(12)  i  <  xrBk  <_  x2.Bk  <.  x3.Bk  <_  ..., 

(13)  x<  yrBk  <  y2.Bk  <y3.Bk  <    .... 

for  e\/ery   l^k^r.  The  inequalities  in  (12)  are  the  order  relation  on  D 

h 
and  those  in  (13)  are  the  order  relation  on  D  .  These  inequalities  result 

from  the  fact  that  the  functions  a.Bk  and  F  .  used  in  the  computation  of  the 

above  sequences  are  monotonic.  This  fact  turns  out  to  be  very  important. 

We  know  that  in  Knuth's  method,  once  an  attribute  is  computed  its  value 

cannot  be  changed,  and  it  is  never  "recomputed"  as  in  our  scheme.  What  we 

need  here  is  the  property  that  once  an  attribute  receives  a  defined  value 
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(i.e.,  =  ±)   it  must  remain  that  way  no  matter  how  many  times  its  value  is 
recomputed.  Only  the  value  of  undefined  attributes  may  change  from  one  itera- 
tion to  another. 

The  inequalities  (12)  and  (13)  guarantee  that  our  computation  has  this 
property.  The  inequality  y-i-B.  1  y2-B.  interpreted  in  the  domain  D-^  means 
that  the  B. -tuples  y-i-Bk  and  y2-Bk  are  such  that  they  may  differ  from  one 
another  only  in  the  undefined  components  of  y-i-B^.  This  also  explains  why  the 
computation  must  terminate;  once  an  attribute  is  defined  it  must  remain  that 
way  in  all  subsequent  steps,  and  there  is  only  a  finite  number  of  attributes. 
Since  the  iteration  is  performed  on  inherited  attributes,  their  number  imposes 
an  upper  bound  on  the  number  of  iterations.  The  same  result  was  obtained 
previously  from  other  considerations. 

The  fixed-point  computation  described  previously  can  be  yery  easily  made 
into  an  algorithm  for  computing  attributes  on  derivation  trees.  The  algorithm 
terminates  even  if  definitions  are  circular.  In  this  case,  some  attributes  will 
remain  undefined.  Unfortunately,  the  remarkable  simplicity  of  an  algorithm 
based  on  the  fixed-point  computation  is  overshadowed  by  its  inefficiency,  so 
presently,  we  see  no  practical  applications  for  such  an  algorithm.  However, 
there  are  some  practical  conclusions  which  can  be  drawn  from  this  approach  which 
are  discussed  in  Section  5. 
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4.5     Example 

Consider  Knuth's  binary  number  K-system  [14,   p.    130]. 
The  original    K-system: 

Production Equation   (2)' Equations   (1)' 

1.  B  ->  0  v.B  =  0 

2.  B  -*  1  v.B  =  2  t  s.B 

3.  L  ->  B  v.L  =  v.B  s.B  =  s.L 

£.L  =  1 

4.  L1  •>  L2B  v.L^   =  v.L2  +  v.B  s.L2  =  s.L^   +  1 

£.L,   =  £.l_2  +  1  s.B  =  s.L, 

5.  N  ->  L  v.N  =  v.L  s.L  =  0 

6.  N  -*■  L1    •   L2  v.N  =  v.L-.   +  v.L2  s.L]   =  0 

^   ♦    La        ""         "X/»   La 

Attribute  s  is  inherited;  v  and  £  are  synthesized.     The  Z-algebra  D  for  this 

K-system  (Definition  4.1)  consists  of  sorts  A  =  {int,  ral },  carriers  D.    .   =  Z 

(the  integers)  and  D     ,   =  Q  (the  rationals),  and  the  operations  of  binary 

ra  i  — 

addition  and  unary  subtraction  on  integers,  binary  addition  on  rationals, 
exponentiation  of  rationals  by  integers  to  yield  rationals,  and  constants 
0,1  e  Z  n  Q.  The  attribute  type  strings  for  each  nonterminal  are 


Inherited 
B"  =  int 

L  =  int 

N  =  e 


The  attribute  valu?  domains  (and  the  variable  names  ranging  over  these 
domains)  are 


Syn 

thesized 

B  = 

ral 

L  = 

ral 

int 

N  = 

ral 

tas  defined  in  Section  3. 
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DB  =  z  (s.B) 

DL  =  i  (s.L,   s.Lr  s.L2) 

$  -  (c>  I 

D§-=Q  (v.B) 

dL  .  Q  x  z  (<v.L,  *.L>.  <v.Lr  t-L^.  <v.L2,  l.L2>) 

qN  »  Q  (v.N) 

The  semantic  functions  need  not  be  .ade  explicit  since  their  do.ain  is  implied 
by  Definition  4.1,  and  they  are  given  by  the  se.antic  equations  interpreted 

in  the  algebra  D. 

I„  order  to  obtain  the  new  purely  synthesized  K-system  and  the  associate 

G-algebra*.  we  first  assume  that  Z  and  Q  have  been  replaced  by  lx  and  Q^ 
and  that  the  operations  of  addition,  etc.  have  been  replaced  by  their  strict 
extensions.     The  carriers  Kft  -  [DA  *  lAL  A  .  ¥„.  of  *are 

KN  =  [{c}  -  QJ  =  Qj. 

where  =  denotes  isomorphism  of  complete  posets.     Ue  use  K„  -  Q,  and  KL  - 
[2    *  Q  ]  X  [2    *  Z  ]  for  convenience,  to  avoid  the  use  of  projection  function, 
.n'its  semantic  equations,  the  new  pureiy  synthesized  K-system  uses  the 
notation      oy.N  .  K,;  VL.  VLV  av.L2  .  Ux  +  Q^S  VL'  °rLl>  VL2  £ 
LZi"Zi];  VB  £   KB- 
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The  new  purely  synthesized  K-system: 

1.  B  ■*  0  ay.B  -  As.O 

2.  B  ■+  1  ay.B  =  As. 2  t  s 

3.  L  ->  B  ay.L  =  Xs.(av.B(s)) 

a..L  =  As.l 

4.  L1  -  L2B  ay.L1  =  As. (av.L2(s+l )  +  ay.B(s)) 

a£.L1  =  As.(av.L2(s+l)  +1) 

5.  N  ->  L  ay.N  =  av.L(0) 

6.  N  -  L]  .  1_2  ay.N  =  ^.L^O)  +  av.L2(Fixz  (-a£.L2)) 

We  now  show  how  to  obtain  the  new  semantic  equation  for  production  6. 
Equations  (4)  corresponding  to  production  6  of  the  original  K-system  are 


(9) 


s.L1  =  0 

s.Lp  =  -ap.L2(s.l_2) 


the  solution  of  which  is 

<E, ,  E2>  =  <0,  Fixz  (-a£.L2)>  e   Z±  x  Z  . 

Thus  a.N  =  a  .L-,(E, )  +  a  .L2(E2),  which  yields  the  desired  equation.     The 
new  semantic  equations  for  the  other  productions  are  obtained  similarly.     The 
associated  G-algebra  operations  are 

8,   =  As.O 

G2  =  As. 2  +  s 

83(v)  =  As.<v(s),   1> 

VV   *!»   v2)  =  ^.^(s+l)  +  v2(s),   £-,(s+l)  +  1> 

e5(v,£)       =  v(o) 

86(v1,   9.]t   v2,   l2)         =  v^O)  +  v2(Fixz   (-&2)), 
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where  v,  v] ,  v2  e  [Z±  ■*  Qj,  SLt  i]t   l^  e   [Z±  ->  Z±],  and  s  e  Z±. 

The  meaning  of  the  string  1101.01,  whose  derivation  tree  is 

t   =  ^6^4^7r4^4^3^2^2^1  ^2^4-^3-^  1^2^  '  iS 

hN(t)  =  e6(e4(e4(e4(e3(e2),e2),e1),e2),e4(e3(e1),e2))  , 

which  evaluates  to  hN(t)  =  23  +  22  +  2°  +  2"2  =  13.25.  Details  are  left 
to  the  reader. 
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5.  Some  practical  considerations 

The  algebraic  definition  of  K-systems  has  some  important  consequences 
concerning  the  methodology  for  developing  semantic  definitions  in  general, 
and  the  use  of  K-systems  in  particular. 

First  of  all,  since  we  are  convinced  that  the  algebraic  formulation 
of  K-systems  is  at  least  as  powerful  as  Knuth's  original  approach,  we  can 
work  directly  in  the  algebraic  framework  as  suggested  in  [12].  There  is  no 
loss  in  power  or  convenience,  and  by  making  evident  the  algebraic  structure 
of  the  semantic  domain,  our  understanding  can  only  gain  from  such  defini- 
tions. Moreover,  the  algebraic  approach  has  none  of  the  "well -definability" 
problems  which  appear  when  using  Knuth's  approach,  and  we  also  benefit 
from  established  proof  techniques  available  in  the  algebraic  method. 

An  even  better  approach  is  the  combined  use  of  both  methods, 
algebraic  and  Knuth's,  for  the  following  reasons.  The  algebraic  method, 
however  precise  and  elegant,  offers  little  aid  to  the  intuition,  since  it 
is  not  always  obvious  which  semantic  algebra  is  required  in  a  particular 
application.  On  the  other  hand,  Knuth's  approach  is  more  intuitive, 
especially  when  computability  is  of  great  concern,  such  as  in  translator 
applications.  In  these  cases  Knuth's  approach  should  help  us  discover  what 
kind  of  algebra  is  needed  in  a  particular  application.  It  is  a  definite 
aid  to  the  intuition  if  we  can  think  in  terms  of  attribute  values  being  passed 
up  and  down  in  derivation  trees  and  write  our  intuition  in  a  precise 
algebraic  manner. 

There  is  an  important  observation  to  be  made  concerning  the  combined 
use  of  the  Knuthian  and  algebraic  approaches.  We  do  not  recommend  first 
writing  a  Knuthian  definition  and  then  trying  to  express  it  algebraically 
as  in  Section  4.3.  This  was  done  only  to  show  that  any  Knuthian  definition 
can  be  given  an  algebraic  formulation,  including  circular  K-systems„  This 
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led  to  recursive  systems  of  equations  which  in  actual  practice  can  and  should 
be  avoided  in  the  interest  of  simpler  definitions. 

The  recursive  nature  of  system  (4)  in  Section  4.3  has  two  origins: 
one  in  the  fact  that  Knuthian  definitions  can  be  circular,  the  other  in  our 
choice  of  carriers  as  K„  =  [D  ■*■  D— ],  A  e  V...  Assuming  carriers  of  this 
form  may  lead  to  recursive  systems  even  though  the  definition  is  not  cir- 
cular. This  happens  because  we  are  assuming  that  e\jery   synthesized  attribute 
of  A  is  a  function  of  all  inherited  attributes  of  A.  In  practice,  however, 
not  all  of  this  functional  dependency  may  actually  exist.  To  determine 
the  actual  dependency,  the  following  definition  is  useful: 

Let  x..A  be  an  inherited,  and  y..A  a  synthesized  attribute  of  A  e  V... 

We  say  that  y..A  depends  on  x..A  iff  there  exists  a  derivation  tree 

with  root  A  such  that  there  exists  a  path  from  x..A  to  y..A  in  the 

•      j 

attribute  dependency  graph  on  that  tree. 
Determining,  for  each  A  e  V.,,  the  dependency  of  the  synthesized  attributes 
of  A  upon  the  inherited  attributes  of  A  requires  an  algorithm  \/ery   similar 
to  Knuth's  K-system  circularity  detection  algorithm.  Once  the  dependencies 
were  determined,  the  algebraic  carriers  could  be  defined  accordingly.  Un- 
fortunately, such  an  algorithm  has  exponential  complexity  [13]  and  therefore 
may  not  be  practical . 

We  advocate  a  different  approach  which  imposes  a  reasonable  additional 
requirement  on  the  K-system  specifier.   In  addition  to  the  requirements  of 
Definition  4.1,  the  specification  of  a  K-system  would  include  for  each 
A  c   V^  a  specification  of  the  dependency  of  the  synthesized  attributes  of  A 
upon  the  inherited  attributes  of  A,  expressed  as  a  directed  graph  d„.  The 
nodes  of  d.  are  labeled  with  the  attributes  of  A,  and  an  arc  <x.A,  y..A> 
is  contained  in  dfl  iff  y...A  depends  on  x..A.  This  information  is  well 

M       J  1 
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known  to  the  system  specifier,  since  he  is  well  aware  of  the  desired  dependency 
between  attributes  in  his  K- system  definition. 

The  dependency  graphs  d„,  A  e  V..,  and  Knuth's  graphs  D  ,  p  e  P  [14, 
p.  135]  can  be  used  to  check  the  consistency  and  completeness  of  a  K-system 
definition  as  follows. 

For  each  production  (say  the  p-th)  A  ->•  aQB-,  •••Brar: 

(a)  Construct  the  directed  graph  union 

d_  =  D  u  dR  u  •••  u  dR  ; 

(b)  Consistency  Check:  d  is  acyclic; 

(c)  Completeness  Check:  If  there  exists  a  path  from  x..A  to  y..A  in  d  , 

then  there  must  exist  an  arc  <x..A,  y-.A>  in  dfl. 

1      J  M 

This  algorithm  has  the  same  complexity  (considerably  less  than  exponential 
[8])  as  a  transitive  closure  algorithm. 

Alternatively,  in  an  algebraic  definition  of  a  K-system,  the  consis- 
tency check  can  be  expressed  as  the  condition  that  the  equation  systems  (4) 
of  Section  4.3  can  be  solved  by  successive  elimination  of  the  variables 
x.B, ,  l<k<r,  without  recourse  to  fixed-point  operations.  In  this  case,  we 


do  not  need  to  extend  the  semantic  domains  D  ,  s  e£  ,  to  complete  posets 

and  the  primitive  semantic  functions  f  e  E  ,  <w,s>  e^  *  x^ ,  to  continuous 

functions. 

5.1  Example 

We  now  repeat  parts  of  Example  4.5  in  light  of  the  foregoing  discus- 
sion. In  addition  to  the  previous  specification,  the  dependency  specification 

is 

.  _  V.N   .    s.L  v.L  Ji.L   .    s.B  v.B 
N      *  L    V  /   *         V 
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This  specification  is  consistent  with  the  semantic  equations  of  the  original 
K-system.  In  particular,  it  indicates  that  Undoes  not  depend  on^L,  and 
therefore,  the  K-system  is  nonrecursive,  as  we  shall  see. 

The  new  purely  synthesized  K-system  and  associated  G-algebra  ft' 
differ  from  what  was  previously  obtained  only  in  the  parts  affected  by  the 
dependency  specification,  namely  the  carriers  of  ft,  and  the  algebraic 

operations  associated  with  productions  3,  4,  and  6.  According  to  the 

dependency  specification,  the  carriers  of  X'  are 


KJ-Q 

K^  =  [Z  ■*  Q]  x  Z 

K£  =  [Z  -  QL 

Ue  now  rederive  the  semantic  equation  associated  with  production  6;  production. 
3  and  4  are  treated  similarly.  Since  IX   does  not  depend  on  s.L,  then 
neither  does  orL.  i.e.,  O..L  .  Z,  and  thus  the  equations  corresponding  to 

(9)  become 

s.L1  ■  0 

{10) 

The  solution  of  (10)   is     <l,  ,h>  •  <0.  V^'  obta,ned  Wlth°Ut  ^"^ 

to  fixed-point  methods,  and  thus  we  obtain  the  foil  owing  purely  synthesized  K-sy:e, 

a  .B  =  Xs.O 
v 

a  .B  =   Xs.2ts 

a  .L  =   \s.(av.B(s)) 
v  v 

o..L  =  1 


1. 

B  ■*"  0 

2. 

B  ■*  1      . 

3. 

L   ♦  B 

4. 

h   *  L2B 

5. 

N  +  L 

6. 

N    >  L]    •   L2 

ov.L,   =Xs.(ov.t2(s+l)   +  ov.B(s)) 


VL1    =  VL2  +  1 
ov.N  -  VL(0) 


av.l)  "  o^L^O)  +  ov.L2(-ort2) 


The  operations  of  the  algebra  X'  are 


9]  =  Xs.O 
e£  =  Xs.2ts 


e'(v)  =  <v,  i> 


8^(vr  £j,  v2)  =  <As.(Vl(s+l)  +  v2(s)),  £]  +  1> 


e^(v,£')  =  v(o) 


e6(vr£i'  v2»  %t)   =  vi(0)  +  V2("£P» 
where  v,  v-. ,  v~  e  [Z  -»■  Q]  and  £',  ll,   £->  s  £  Z. 
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6.  Equivalence  of  K-Systems 

In  this  section  we  demonstrate  the  usefulness  and  elegance  of  the  alge- 
braic formulation  of  K-systems  by  applying  it  to  the  problem  of  proving  two  K- 
systems  equivalent,  in  the  sense  of  Definition  3.2.  Our  method  applies  only 
to  K-systems  that  have  the  same  underlying  grammar.  Carrying  out  such  equiva- 
lence proofs  may  require  proving  that  one  of  the  given  K-systems  possesses 
certain  properties.  To  this  end  we  formulate  an  induction  principle  for 
many-sorted  algebras.  All  of  the  foregoing  concepts  are  illustrated  by  an 
example. 

We  first  present  our  method  for  proving  two  K-systems  with  the  same 
underlying  grammar  equivalent  in  the  sense  of  Definition  3.2.  The  method  is 
given  and  justified  by  the  following  theorem. 

6.1  Theorem 

Let  K  and  M  be  K-systems  with  the  same  underlying  grammar 
G  =  <VN>  VT,  P,  S>,  and  associated  semantic  algebras  7C  and^U,  respectively. 

We  assume  that  K_  =  M_,  i.e.,  the  carriers  of  sort  S  in  algebras  Jv  and  "L 

are  identical.  Then,  K  and  M  are  equivalent  (in  the  sense  of  Definition  3.2) 
if  there  exists  a  homomorphism  e:  ju  -*•  m   such  that  e<-  is  the  identity  function 
on  Ks  (i.e.,  e$  =  1 K  ). 

Proof:  Since  Tr  is  initial,  there  exist  unique  G-homomorphisms  h:  Tp  ■*■ "}(/  and 
g:  Tr  ->aftl>  Since  the  composition  of  G-homomorphisms  is  also  a  G-homomorphism 
[12],  we  have  eoh  =  g,  i.e.,  the  diagram 

TG 


% — >in 
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commutes.  Thus  for  all  t  e  Tf  <.,  es(hs(t))  =  gs(t),  which  implies  for  all 
t  e  lr   c»  hs(t)  =  gs(t)  (since  es  =  1.,  ),  establishing  the  equivalence  of 
K  and  M.    IS 

We  now  illustrate  the  use  of  Theorem  6.1  via  an  example. 


6.2  Example 

Consider  yet  another  K-system  for  the  semantics  of  binary  numbers, 
represented  by 


Product! 

ons 

Semantic  Rules 

1. 

B  ->  0 

v.B  =  0 
£.B  =  1 

2. 

B  ■>  1 

v.B  =  1 
Jl.B  =  1 

3. 

L  ■*  B 

v.L  =  v.B 

4. 

Ll  -  L2B 

v.L1  =  2  *  v.L2  +  v.B 
JI.L-,   =  £.L2  +  1 

5. 

N  •>  L 

v.N  =  v.L 

6. 

N-LrL2 

v.N  =  v.L1   +  v.L2  *  2+(-£.L2) 

This  K-system  is  purely  synthesized  and  has  the  same  underlying  grammar  as 

those  of  Examples  4.5  and  5.1.   Its  E-algebra  D  consists  of  sorts  (int.,  rail , 

carriers  D.  .  =  Z  (the  integers)  and  D  ,  =  Q  (the  rationals),  and  operations 

i n t  ra  I 

u>  ls  -»  +,  *,  and  l  as  in  Examples  4.5  and  5.1.  The  attribute  type  strings 
for  each  nonterminal  are  F=U=N  =  t.,B_=LL=  raj[  int.,  and  N  =  ral .  The 

synthesized  attribute  value  domains  (and  the  variable  names  ranging  over  these 


Page  39 
domains)  are 

D^  =  Q  x  Z  (<v.B,   £.B>) 

[t  =  Q  x  z  (<v.L,   JLL>,  <v.Lls  fc.L|>,  <v.L2,  l.l2>) 

D^-=  Q  (v.N) 

Let  ^Tldenote  the  semantic  G-algebra  associated  with  the  above  K-system;  the 
carriers  of  nt  are  M  =  M,  -  Q  x  z  and  MN  =  Q.  The  operations  of  Ttt.  are 

T1  =  <0,1> 
Tp  =  <1  ,1> 
T3(V,£)    =    <V,£> 

T4(v1,£1,v2,i»2)  =  <2*v1  +  v2,  l}  +   1> 

T5(V,H)  =  V 

T6(v1,il1,v2,£2)  =  v1  +  v2*2+(-£2) 

where  v,v-,,v2  e  Q  and  fc.JL.JU  e  Z. 

We  now  prove  the  equivalence  of  the  above  K-system  and  the  K-system  of 
Example  5.1,  whose  semantic  algebra  is  "J(J   with  carriers  K',  K/,  K'  and  opera- 
tions 9-!,...,  GA  as  defined  in  Example  5.1.  Following  Theorem  6.1,  it  is  suf- 
ficient to  define  a  mapping  e:  Jc'  >Ptyl>   where  e  =  {e..,  e.  ,  eR}  and  e.,  =  1~, 
and  to  show  that  e  is  a  G-homomorphism.  To  this  end,  define  e«:  K\  ■*■   M», 
A  =  N,L,B  as 

eN  =  ]Q 

eL:  [Z  ■*  Q]  *  1   ->  Q  x  Z     where   eL(f,z)  =  <f(0),  z> 

eB:  [Z  v  q]  ->  q  x  z     where   eg(f)  =  <f(0),  1>. 

To  show  that  e  is  a  G-homomorphism,  the  following  analysis  by  cases  must  be 
performed. 
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(al)  SHOW  eB(ej)  =  t] 


LHS  =  eB(Xs.O)  =  <Xs.O(0),  1>  =  <0,  1> 
RHS  =  <0,  1>. 

(a2)  SHOW  eB(6^)  =  x2 

LHS  =  eB(As.2ts)  =  <As.2ts(0),  1>  =  <1 ,  1> 
RHS  =  <1,  1>. 

(a3)  SHOW  eL(e^(v))  =  x3(eB(v)) 
LHS  =  eL(v,  1)  =  <v(0),  1> 
RHS  =  t3(v(0),  1)  =  <v(0),  1>. 

(a4)  SHOW  eL(e^(vr  £],  v2))  =  T4(eL(vr  Z\) ,   eB(v2)) 
LHS  =  eL(Xs.(Vl(s+l)  +  v2(s)>,  SL\   +  1) 

=  <Xs.(v1(s+l)  +  v2(s))(0),  i\  +   1> 

=  <v1(l)  +  v2(0),  i\  +   1> 
RHS  =  t4(Vi(0),  i\t   v2(0),  1) 

=  <2*v1(0)  +  v2(0),  i\  +   1>. 

(a5)  SHOW  eN(e^(v,a'))  =  T5(eL(v,£')) 
LHS  =  eN(v(0))  =  v(0) 
RHS  =  x5(v(0),  V)   =  v(0). 

(a6)  SHOW  eN(e^(vls  l\,  M^  l^))   =  T6(eL(vr  i\) ,  eL(v2,  *,£)) 
LHS  =  eN(Vl(0)  +  V2(-A2))  =  v^O)  +  v2(-Aj) 
RHS  =  TgCv^O),  l\,   v2(0),  Vz) 
=  Vl(0)  +  V2(0)*Z*(-£J). 

Identities  (al),  (a2),  (a3),  and  (a5)  are  clearly  satisfied.  To  show  the 
Isatisfaction  of  (a4)  and  (a6)  it  is  sufficient  to  prove  that 

(11)  (Vz  c   Z)(vi(z)  =  v.(0)  *  2+z), 

where  v.  e  [Z  ->  Q],  and  i=l  for  (a4)  and  i  =  2  for  (a6). 
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6.3  Remarks 

Unfortunately,  equation  (11)  is  not  satisfied  for  aVT_  V-,  ,v~  e  [Z  -*■  Q]. 
However,  we  need  be  concerned  in  (11)  only  with  those  values  of  v,  and  v2  that 
can  occur  within  the  definition  of  the  K-system  of  Example  5.1.  More  particu- 
larly, the  values  of  V-,  in  (a4)  and  v?  in  (a6)  can  arise  only  as  the  result  of 
mapping  L-rooted  parse  trees  t  a   T~  .  into  their  semantic  meanings  via  the 
unique  homomorphism  h':  T«  ->  JC ,  namely  via  h/:  T~  .  ->  K ' .  Viewing  (11) 
as  a  (total)  predicate  on  [Z  ->  Q],  we  must  prove  that  for  all  t  e  T«  ,  >  the 
function  v  =  pr-|(h/(t))  satisfies  (11),  where  pr-,  extracts  the  first  component 
of  h^(t)  (recall  that  K[  =  [Z->Q]  x  Z). 

We  now  state  and  apply  an  induction  principle  which  will  prove  useful 
in  demonstrating  such  results. 

6.4  Theorem  (Principle  of  Structural  Induction  on  Many-Sorted  Algebras) 

Let  E  be  an  ^-sorted  operator  domain,  let  X  be  any  E-algebra,  and  let 
h:  T  -►  X  be  the  unique  E-homomorphism  from  T„  to  X.  Let  P  =  (P.).  c   be  an 
^-indexed  family  of  total  predicates  on  X,  i.e.,  P  is  a  total  predicate  on 
X  for  all  s  c  £  . 

If    (0)   (Vs  e   i  )(Va  e    EE$s)(Ps(ax)) 

and   (1)   (V<w,s>  c   4+  x  A  )(Va  e   ZWjS)(Vx  e  XW)(  A  PW(*)  -   Ps(ax(x))) 

then  (2)  (Vs  c  &   )(Vt  e   T^)  (Ps(h$(t) ) ) , 

w 
where  if  w  =  s-i»«»S  and  x  =  <x, ,. . .  ,x  >  e  X  ,  n>0,  then 

n         1     n 


APW(><)  ^  P,  (Xj  A  ...  A  P   (X  ) 


Proof:  The  theorem  follows  directly  from:   (a)  the  predicates  P  can  be  viewed 
as  a  family  of  sets  P  £  X  for  all  s  c  &   ;  (b)  (0)  and  (1),  if  true,  express 
that  P  can  be  regarded  as  a  subalgebra  of  X;  (c)  h(T  )  is  the  least  subalgebra 

t  We  are  interested  in  e  being  a  homomorphism  from  h'(Tp)  into??!  rather  than 
one  from  yj   to  Yfl  . 
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of  X  (an  easy  consequence  of  the  initial i ty  of  T  );  therefore  for  all  s  e  >&   , 
h(Tv)  £  P  >  the  desired  conclusion.    81 

In  particular,  if  for  all  s  e  J&  ,  P  is  a  total  predicate  on  T    satis- 

fying  (0)  and  (1),  then  P  (t)  is  true  for  all  s  c  &   and  t  e  Tv  ,  since 

f 
h:  T^  ■>  Tj,  is  the  identity  homomorphism.  We  shall  now  apply  Theorem  6.4  to 

complete  the  proof  of  Example  6.2. 

6.5  Completion  of  Proof  of  Example  6.2 

Define  the  following  predicates  on  the  G-algebra  "]£   of  Example  5.1. 

(12a)  PB(v)  =  (Vz  £  Z)(v(z)  =  v(0)  *  2+z) 

(12b)  PL(v,Jl)  =  (Vz  e   Z)(v(z)  =  v(0)  *  2+z) 

(12c)  PN(q)  =  true,  where  v  e  [Z  i>  Q],  £  e  Z,  q  e  Q. 
Theorem  6.4  requires  consideration  of  the  following  cases. 

(bl)  PB(e])  =  (Vz  e  Z)Us.O(z)  =  As. 0(0)  *  2+z) 
=  (Vz  e  Z)(0  =  0  *  2+z)  =  true 

(b2)  PB(e^)  =  (Vz  e  Z)(As.2+s(z)  =  Xs.2+s(0)  *  2+z) 
=  (Vz  e  Z)(2+z  =  (2+0)  *  (2+z))  =  true 

(b3)  Assume  PQ(v)  for  v  e   [Z  ->  Q].  Then 

PL(63(v))  =  PL(v,l)  =  (Vz  c  Z)(v(z)  =  v(0)  *  2+z)  =  true, 

which  follows  directly  from  the  hypothesis. 

(b4)  Assume  PL(v-j,£|)  for  <v-,,J^>  e  [Z  ■+   Q]  *  Z  and  PB(v2)  for  v£  e  [Z  -*•  Q].  Then 

PL(64(v1,aj,v2))  =  PL(xs.(v1(s+l)  +  v2(s)),  fcj  +  1) 

=  (Vz  c   Z)(v-,(z+l)  +  v2(z)  -  (v-,0)  +  v2(0))  *  2+z)  =  true, 

which  follows  readily  from  the  hypotheses. 

Cases  (bl)  through  (b4)  and  Theorem  6.4  thus  establish  that  for  all 


t  The  usual  mathematical  induction  can  be  obtained  from  the  above  principle  by 

taking  J    =   {inth  £    =  {0},  E.   .   =  {succ}  and  s,,  _  =  $  for  all  other  <w,s> 
e,s        int,int  w,s 


Page  43 

t  £  T„  ,  ,  P.(h'(t)),  which  shows  that  pr,(h.'(t))  satisfies  (11)  of  Section  6.3. 
Thus  the  K-systems  of  Examples  5.1  and  6.2  are  equivalent.    0 

In  order  to  prove  the  equivalence  of  the  K-systems  of  Examples  4.5  and 
6.2,  the  algebraic  carriers  Z  and  Q  of  Example  6.2  must  be  extended  to  domains 
Z  and  Q  ,  and  the  primitive  semantic  functions  must  be  replaced  by  their 
strict  extensions,  so  that  the  two  K-systems  are  comparable.  Then  the  mappings 
e  =  {eN,eL,eB}  such  that 

eN=  \ 

V  [zi  ■*  Qi]  x  [zi  *  zi]  "  Qi  x  zi  where  eL(f'J)  =  <^Co),j(o)> 

eB:  [Z±  ->  Q±]  -*  Qi  x  Z±  where  eB(f)  =  <f(0),l> 

can  be  defined.  To  show  that  e  is  a  G-homomorphism  reduces  to  proving  that 

v-,0)  =  2  *  v-,(0) 
(13)     ^(1)  =  ^(0) 

v2(Fixz  (-£2))  =  v2(0)  *  2+(-A2(0)) 

where  the  values  of  <v,  ,l^>,   <v2»£2>  e  [Z  ->  Q  ]  x  [Z  •*■  Z  ]  can  arise  only  as 
the  result  of  mapping  parse  trees  t  e  T~  ,  into  their  semantic  meanings  via 
h.  :  Tp  .  ->   K.  .  Properties  (13)  can  be  verified  via  Theorem  6.4  with  predicates 
PB  on  [Z±   ->  Q±]  and  ?L   on  [Z±  ■>  Q±]  x  [Z±  ■>  Z±],  where 

PB(v)  =  (Vz  e  Z)(v(z)  -  v(0)  *  2+z) 

PL(v,£)  ■  PB(v)  a  (Vz  €   Z)(Vy  c  Z)U(z)  =  £(y)). 

Details  are  left  to  the  reader. 
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7.  Conclusion 

Knuthian  semantic  systems,  or  K-systems,  are  a  useful  and  intuitively 
appealing  method  for  specifying  syntax-directed  semantics  of  context-free 
languages.  We  have  formalized,  within  the  framework  of  initial  algebra 
semantics,  K-systems  containing  both  inherited  and  synthesized  attributes, 
thus  combining  the  intuitive  appeal  of  K-systems  with  the  power  and  elegance 
of  algebraic  methods.  Since  Knuth's  initial  presentation  of  K-systems  was 

informal,  we  have  left  open  the  problem  of  proving  it  equivalent  to  our 

f 
algebraic  formulation. 

Our  formulation  of  K-systems  enables  proof  of  the  properties  of  the 
semantics  of  context-free  languages  (such  as  correctness  of  translators 
and  their  implementation)  to  be  conveniently  carried  out  by  induction  on  the 
(context-free)  syntactic  structure  of  the  underlying  grammar  [4]. 

An  important  byproduct  of  our  formulation  is  a  method  for  transforming 
a  K-system  containing  both  inherited  and  synthesized  attributes  into  an  equiv- 
alent purely  synthesized  K-system.  The  new  purely  synthesized  K-system  has 
the  same  underlying  grammar  as  the  original,  but  its  semantic  domains  are 
functions  from  the  original  inherited  attribute  semantic  domains  to  synthesized 
attribute  semantic  domains. 

This  transformation  generally  requires  the  solution  of  recursive  sys- 
tems of  equations.  Such  a  solution  is  achieved  via  fixed-point  methods  with 
all  their  attendant  mathematical  apparatus,  thus  requiring  the  algebraic 
specification  of  the  given  K-system  to  be  or  extended  to  be  continuous.  For 
practical  purposes  we  advocate  the  use  of  only  those  K-systems  in  which  these 
systems  of  equations  are  nonrecursive,  i.e.,  equations  that  can  be  solved 
by  successive  elimination  of  variables  without  recourse  to  fixed-point 
methods „  We  feel  that  such  nonrecursive  K-systems  are  sufficient  for  the 


t  In  effect,  we  offered  a  "mathematical"  definition  in  the  sense  that  the  algebraic 
semantics  is  independent  of  any  algorithm  for  attribute  computation.  In  contrast, 
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vast  majority  of  practical  applications. 

To  foster  the  use  of  these  more  practical  K-systems,  we  have  suggested 
that  the  specification  of  a  K-system  include  for  each  A  e  V\,  a  specification 
of  the  dependency  of  the  synthesized  attributes  of  A  upon  the  inherited 
attributes  of  A.  The  K-system  can  then  be  checked  for  "consistency"  (the 
K-system  is  nonrecursive)  and  "completeness"  (the  K-system  actually  contains 
all  the  specified  attribute  dependencies)  with  respect  to  this  specification. 

We  advocate  the  use  of  both  Knuth's  original  approach  and  our  algebraic 
formulation  as  a  joint  means  of  designing  and  defining  K-systems  for  partic- 
ular applications.  Our  intuition  is  better  aided  by  the  visualization  of 
inherited  and  synthesized  attribute  values  being  computed  and  communicated 
over  attribute  dependency  graphs  superimposed  on  derivation  trees  (an  advantage 
originally  cited  by  Knuth  [14]),  whereas  the  algebraic  formulation  renders 
our  intuition  precise  and  enables  us  to  use  associated  powerful  proof 
techniques. 

The  usefulness  of  the  algebraic  formulation  of  K-systems  was  demon- 
strated via  its  application  to  proving  the  equivalence  of  K-systems  having 
the  same  underlying  grammar.   It  was  found  that  such  proofs  may  require 
verifying  that  a  K-system  possesses  certain  properties.  Toward  this  latter 
end,  a  principle  of  structural  induction  on  many-sorted  algebras  was  formu- 
lated, justified,  and  applied. 

t  Knuth's  method  (as  well  as  those  in  [3],  [7],  [18])  is  "operational,"  i.e., 
the  definition  is  given  by  a  particular  algorithm.  A  possible  solution  to 
proving  the  two  definitions  equivalent  is  to  explicitely  write  Knuth's  defini- 
tion as  an  algorithm  (program)  and  prove  it  consistent  with  our  mathematical 
specification  using,  for  instance,  Floyd-Hoare  method. 
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APPENDIX 

How  to  Transform  K-Systems  to  Satisfy 
Conditions  (iii)  and  (iv)  of  Section  3 

In  this  appendix  we  present  a  construction  which  transforms  a  K-system 
which  does  not  satisfy  conditions  (iii)  and  (iv)  of  Section  3  into  an  equivaler 
one  that  does. 


Let  x  and  y  be,  respectively,  the  inherited  and  synthesized  attributes 
of  the  given  K-system,  and  let  w  and  z  be,  respectively,  new  (distinct  from 
x  and  y)  inherited  and  synthesized  attributes  with  the  same  value  domains 
as  x  and  y.  Let  G  =  <VN,  VT,  P,  S>  be  the  CFG  underlying  the  given  K-systerr 
and  consider  a  typical  (say  the  p-th)  production 

A-  a0B1a]...Brar  , 

and  the  following  four  cases. 

Case  1  (A  ■+   S,  r  >  0) 

The  given  semantic  equations  in  this  case  are 

(Al.l)     x.Bk  =  F  k(x.A,  y.Br...,  y.Br,  y.A,  x.B^...,  x.Br)  ,  l<k<r 

(A2.1)      y.A  -  G  (x.A,  y.B^...,  y.Br,  y.A,  x.B,,...,  x.Br). 
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Equations  (Al )  and  (A2)  do  not  satisfy  conditions  (iii)  and  (iv)  of  Section 
3.  Replace  these  semantic  equations  by 

(A3a.l)  x.Bk  =  F  .(x.A,  y.Bp...,  y.Br»  w.A,  z.B^...,  z.Br)  ,  1  <k<r 

(A3b.l)  w.Bk  =  y.Bk,  l<k<r 

(A4a.l)  y.A  =  G  (x.A,  y.B^...,  y.Bp,  w.A,  z.B^...,  z.Br) 

(A4b.l)  z.A  =  x.A 

Case  2  (A  =  S,  r  >  0) 

The  given  semantic  equations  are 

(A1.2)    x.Bk  =  F  .(y.B1 , . . . ,  y.Br,  y.S,  x.Bj,...,  x.Br)  ,  l<k<r 

(A2.2)    y.S  =  G  (y.B^...,  y.Br,  y.S,  x.B^...,  x.Br).  . 

The  replacement  for  these  semantic  equations  and  corresponding  production 

S  ->  anB, •••B  a  is.  not  as  simple  as  in  Case  1.  First,  replace  (Al )  and  (A2)  with 

(A3a.2)    x.BR  =  F  .(y.B^...,  y.Br,  w.S,  z.B-, ,..;-,  z.Br)  ,  l<k<r 

(A3b.2)    w.Bk  =  y.Bk  ,  l<k<r 

(A4b.2)    y.S  =  G  (y.Bj,...,  y.Br,  w.S,  z.Bj,...,  z.Br). 

Note  that  there  is  no  counterpart  to  equation  (A4a.l).  Let  T  i   V,.  u  Vj 
be  a  new  distinguished  symbol.  Add  the  following  production  and  semantic 
rules  to  the  new  K-system: 

(A5.2)     T->  S  w.S  =  y.S 

y.T  =  y.S 

Note  that  K-system  fragment  (A5.2)  need  be  added  only  once  to  deal  with  all 
S-productions  (Cases  2  and  4).  Also  note  that  the  former  distinguished 
symbol  S  now  has  inherited  attributes  w. 
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Case  3  (A  f   S,  r  =  0) 

The  semantic  eqaation  associated  with  A  ->  afi,  A  f   S,  is 

(A203)  y.A  =  G  (x.A,  y.A). 

Let  A1  ^  v.,  u  Vj  u  {T}  be  a  new  nonterminal  distinct  from  any  other  new  non- 
terminal introduced  in  the  process  of  transforming  the  original  K-system 
into  one  that  satisfies  conditions  (iii)  and  (iv)  of  Section  3.  Replace 
A  +  aQ  and  (A2.3)  by 

(A6.3)     A  ->  A'         x.A'  =  x.A 

w.A'  =  y.A' 


y.A  =  y.A 


(A7.3)    A'  ->  aQ       y.A'  =  G  (x.A' ,  w.A'). 

Case  4  (A  =  S,  r  =  0) 

This  case  is  similar  to  Case  3,  but  with  no  attributes  x.S  and  x.S'.  The 

replacement  K-system  fragments  are 

(A604)     S  ->  S'         w.S'  =  y.S' 

y.S  =  y.S' 

(A7.4)     S'  ->   aQ       y.S1  =  Gp(w.S'). 

Of  course  fragment  (A5.2)  will  "connect"  (A6.4)  and  (A7„4)  to  the  new 
distinguished  symbol  T. 

Noting  that  w  and  x  are  inherited  and  y  and  z  are  synthesized  attri- 
butes, it  is  easily  seen  that  all  versions  of  equations  (A3)  through  (A7)  in 
Cases  1  -  4  satisfy  conditions  (iii)  and  (iv)  of  Section  3.  By  constructing 
local  attribute  dependency  graphs  of  the  original  K-system  and  corresponding 
graphs  of  the  new  K-system,  it  can  be  readily  established  that  the  two 
systems  are  equivalent. 
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